Golang Job: Staff Observability Engineer (SRE) - open to remot

Job added on

Company

Canva

Location

Sydney - Australia

Job type

Full-Time

Golang Job Details

Canva’s Commitment and Mission

At Canva, we celebrate diversity. We deeply believe that bringing together diversity of thoughts, perspectives and expression is key to building the best product, team and company. We look for many different skills and abilities, as well as how you can enhance Canva and our culture. So, even if you don’t think you quite meet all of the skills listed or tick all the boxes, we’d still love to hear from you!

Our mission at Canva is to empower the world to design and since launching in 2013, we have grown exponentially, amassing over 110+ million monthly active users across 190 different countries and a team of over 3,400 people… and the best bit is that we’ve only achieved 1% of what we know we’re capable of.

Join us and design your future.

The Observability Team is responsible for all operational insights inside Canva. Our goal is to provide our development team with world-class tools to view how their services are performing in production. We achieve this by combining industry-leading third-party solutions with our own in-house developed solutions.

We work across the entire stack maintaining our metrics and logging Java libraries, our logging and metrics infrastructure, error and alert routing, and our incident management process to name just a few. As we scale all of these areas require more sophisticated solutions to ensure that Canva developers can continue to grow without compromising on reliability or availability.

What you'll do (Responsibilities)

  • As a technical lead, you will guide the design and implementation of some of our major projects like scaling our Logging, Metrics and Tracing tooling.
  • Help identify gaps in Observability tooling at Canva and help take it to the next level.
  • Be proactive in modernising our Observability tooling and participate in brainstorming and POC’ing to determine how we can improve our existing systems
  • Become an authority in the area of Observability and evangelise the benefits of accurate instrumentation throughout the organisation
  • Help build new features into our Logging and Alerting pipelines to ensure that people are getting notifications for errors that are related to their domains
  • Help develop solutions to tackle difficult problems such as how to deal with high-cardinality metrics which we can’t solve with our existing tooling
  • Guide and mentor other people on the team and help them grow their skills and knowledge

Required skills & experience

  • Comfortable and happy to code in Python, Java or Golang
  • Deep knowledge and understanding of Computer Engineering fundamentals and first principles
  • Experience building and running monitoring infrastructure at a large scale. For example, Terabyte-scale Elasticsearch clusters, Prometheus, Kibana, Grafana, etc
  • Experience working with and monitoring microservice architectures in large distributed cloud environments (ideally AWS).
  • Experience with Observability tooling – having proficiency with tools like Datadog, Elasticsearch, Kibana, Loggly, Sentry, or similar
  • Experience with infrastructure-as-code, we’re a Terraform shop, but strong experience with other IaC tools will do the trick
  • The ability to guide and mentor other members within the team and improve the way we collaborate, learn, and share ideas
  • This role requires a lot of documentation and internal team members alignment; therefore strong written and verbal communication skills are required

Not essential; but helpful experience!

  • Experience writing application code in Java since we also maintain the metrics, tracing and logging java libraries.
  • Experience building and running monitoring infrastructure on a large scale. For example, Terabyte-scale Elasticsearch clusters, Prometheus, Kibana, Grafana, etc
  • Experience with OpenTelemetry or writing code in Golang since we are adopting OpenTelemetry in a lot of the tooling we provide.

Perks and Benefits

  • Competitive salary, plus stock options via our ESOP plan
  • Flexible daily working hours, we value work-life balance
  • Breakfast and lunch prepared by our wonderful Vibe team
  • Onsite-Gym and Yoga Membership
  • End-of-Trip Facilities: Bicycle parking and showers
  • Generous parental (including secondary) leave policy
  • Pet-friendly offices
  • Internal Coaches and an Employee Wellness Program
  • Sponsored social clubs, team events, and celebrations
  • Relocation budget for interstate or overseas individuals (including visa sponsorship)
#LI-RH1

Working at Canva

Our culture is unlike anywhere else and we design your #CanvaLife experience to empower you to do the best work of your life.

Whether you’re in the office, working from home or choosing your own adventure, our benefits for permanent Canvanauts include:

  • Equity packages for you to truly be a part of the Canva journey.
  • We have a hybrid work model (in-office & from home), with our offices are always open to you balancing flexibility and connection
  • Flexible leave so you can recharge, give back, support others or focus on your own professional development.
  • Inclusive parental leave policy that supports all parents and carers throughout their parenting and caring journey.
  • An annual Vibe & Thrive allowance. This is for you to spend on whatever will support your wellbeing and development.. because you know what you need to Vibe and Thrive, better than anyone.
  • Virtual and in-office wellness benefits including Canva University, Employee Assistant Programs and Fitness & Meditation Classes.
  • Canva For Good program matching your not-for-profit donations, Force for Good leave (3 paid volunteering days) and a range of sustainability and ethical initiatives to get involved in.

We make hiring decisions based on your experience, skills and passion. Please note that interviews are conducted virtually. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

Staff Observability Engineer (SRE) - open to remote across ANZ